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O . Abstract 

Vh ' We consider deterministic and stochastic perturbations of dynamical systems 

with conservation laws in R 3 . The Landau-Lifshitz equation for the magnetization 
dynamics in ferromagnetics is a special case of our system. The averaging principle 
is a natural tool in such problems. But bifurcations in the set of invariant measures 
lead to essential modification in classical averaging. The limiting slow motion in this 
case, in general, is a stochastic process even if pure deterministic perturbations of 
a deterministic system are considered. The stochasticity is a result of instabilities 
in the non-perturbed system as well as of existence of ergodic sets of a positive 
measure. We effectively describe the limiting slow motion. 
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1 Introduction 

m 

o . 

The analytical study of magnetization dynamics governed by the Landau-Lifshitz 
equation (see [18]) has been the focus of considerable research for many years. In 
normalized form this equation reads as (see [5], equations (2.51) and (2.53)): 



X 



— = -m x h cff - am x (m x h cff ) , m(r, 0) = m (r) G M 3 , |m (r)| = 1 . (1.1) 

Here h c g is an effective field. The three-dimensional vector m(r, t) is the magneti- 
zation of the material at a fixed point r £ R 3 at time t; The term am x (m x h c g) is 
the Landau-Lifshitz damping term, < a « 1. One can check that (1.1) preserves a 
first integral F(m) = — |m| 2 . Therefore for fixed r, the system (1.1) describes a motion 
on the sphere in M 3 . 
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One can introduce an energy density function G such that VG = — h e g. Then 
equation (1.1) can be written as follows: 

— = m x VG + am x (m x VG) , m(0) = m G R 3 , |m | = 1 . (1.2) 

We assume that G is a smooth generic function. Considered on the unit sphere 
S 2 in R 3 , such a function may have three types of critical points: maxima, minima and 
saddle points. Without the damping am x (m x VG) the energy density G is preserved. 
One easily checks that VG- (am x (m x VG)) = — a|m x VG| 2 so that the damping term 
is a kind of "friction" for the system (1.2), just like the classical friction in Hamiltonian 
systems (compare with [4]). 

If < a « 1, the dynamics of (1.2) has two distinct time scales: the fast time 
scale of the precessional dynamics and the relatively slow time scale of relaxational 
dynamics caused by the small damping term am x (m x VG). Therefore it is natural 
to use the averaging principle to describe the long-time evolution of energy density G. 
However the classical averaging principle here should be modified: existence of saddle 
points of G(m) on the sphere {|m| = 1} leads to stochastic, in a certain sense, behavior 
of the slow motion even in the case of purely deterministic damping term (compare with 
[4]). Moreover, in Section 5, we consider a more general class of equations, where level 
set components of first integrals, which are compact two-dimensional surfaces may have 
topological structure different from a sphere. If genus of such a surface is positive, the 
non-perturbed system can have positive area ergodic sets. Existence of such sets lead to 
an " additional stochasticity" . Description of the stochastic process which characterizes 
the long-time evolution of the energy is one of the main goals of this paper. 

Random perturbation caused by thermal fluctuations become increasingly pro- 
nounced in nano-scale devices. To take this into account one can include in the right- 
hand side of (1.2) a small stochastic term. This stochastic term, in general, introduces 
one more time scale in the system. Interplay between the influence of small damping 
and even smaller stochastic term leads to certain changes in the metastability of the 
system. Description of the metastable distributions is another goal of this paper. There 
are some other asymptotic regimes of the Landau-Lifshitz dynamics which we mention 
briefly and we will consider them in more details elsewhere. 



2 Sketch of the paper 

In this section we give an informal sketch of the results. 

In the next two sections we consider perturbations of the following equation 



X t = VF(X t ) x VG(X t ) , X = x G R 3 , (2.1) 

which could be regarded as a generalized Landau-Lifshitz equation. 

Here G(x) and Fix), xGR 3 , are smooth enough generic functions (this means that 
each of these functions has a finite number of critical points which are assumed to be 
non-degenerate), lim F{x) = oo. The initial point xo = xq{z) is chosen in such a way 

|x|— >oo 

that F(xq(z)) = z. As before we call G{x) energy (to be precise, G{x) in (1.2) is the 
energy density but for brevity we call it energy). 

It is easy to see that F{x) and G{x) are first integrals of system (2.1). For instance, 

^^ = VF(X t ) ■ (VF(X t ) x VG(X t )) = . 

Note also that the Lebesgue measure in IR 3 (the volume) is invariant for system 
(2.1): 

div(VF(x) x VG(x)) = VG(x) • (V x VF(x)) - VF(x) • (V x VG(x)) = . 

This implies, in particular, that — — - is the density of an invariant measure of system 

|VF(x)| 

(2.1) considered on the surface S(z) = {i £ B 3 : F(x) = z] with respect to the area on 
S(z). Notice that the surface S(z) may have several connected components. For brevity 
in the next two sections, and in the rest of this section (except the last four paragraph), 
we assume that the level surface S(z) = {x G M 3 : F(x) = z} has only one connected 
component and this component is homeomorphic to S 2 . In Sections 5 and 6 we will 
drop this assumption and consider more general situations. 

As we already mentioned, the damping term in (1.2) preserves the first integral 
— |m| , so that we consider, first, perturbations of (2.1) preserving F(x). The perturbed 
equation can be written in the form 

X t = VF(X e t ) x VG(X e t ) + eVF(X £ t ) x b(X[) , X £ = x G M 3 . (2.2) 

Here b(») is a smooth vector field in R 3 . In the next two sections we assume for 
brevity that the perturbation eVF x b is of "friction" type: 

VG(x) • (VF(i) x b(x)) < , x G S(z) C M 3 . (2.3) 

Note that any vector field VF(x) x b(x) can be written in the form VF(x) x 
(V-F(x) x b(x)) for some vector field b(x) G IR 3 . Indeed, without loss of generality one 
can assume that b(x) _L VF. Each such vector b(x) can be represented as V-F(x) x b(x). 
So that the perturbed equation can be written as 
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X t = VF(Xf) x VG(X e t ) + eVF{X £ t ) x (VF(X £ t ) x h{X £ t )) , X e = x € R 3 . (2.4) 

Furthermore, using the identity A • (B x (C x D)) = (A x B) • (C x D) we can 
check that 

VG • (VF x (VF x b)) = -(VF x b) • (VF x VG) . (2.5) 

Therefore the "friction-like" condition (2.3) becomes 

(VF x b) • (VF x VG) > . (2.6) 

The equation (1.2) corresponds to the case that h(Xf) = 1 \JG{Xf) and F(Xf) = 
Xf\ 2 . One easily checks that system (2.4) preserves F so that Xf is moving on a 
certain level surface {F = z}. 

We make some geometric assumptions that are used in Sections 3 and 4. Suppose 
that the set S(z) = {x £ R 3 : G(x) < G(x (z)) + 1} n {x £ R 3 : F(x) = z} is a 
2-dimensional Riemannian manifold which is G°°-diffeomorphic to R = {(a,b) 6 R 2 : 
a 2 + b 2 < 1}. Let the G°° diffeomorphism be / : S(z) — > R. To be specific, we denote 
/(xi,x 2 ,2;3) = {fi{xi,%2,x?,)-, f2{xi,X2-,xz)) for (xi,X2,x^) G S'(z). We assume that the 
diffeomorphism / is non-singular for (x\, X2, x%) £ S(z). We denote by d(», •) the metric 
on S(z) induced by standard Euclidean metric in R 3 . Let our function G on S(z) have 
only one saddle point and two minima, and these critical points are non-degenerate. 
Assume that the level surfaces {G = g} are transversal to the level surface {F = z\. 
VF(x) and VG(x) are not parallel. We denote by C(g, z) the set {G(x) = g} n {F(x) = 
z}. Without loss of generality we can assume that G(0, z) = {G = 0} n {F = z} is the 
oo-shaped curve (homoclinic trajectory) on {F = z} corresponding to the saddle point 
of G. Let the saddle point of G on {F = z} be 02(z) and the two minima be 0\{z) and 
03(21). Suppose that as z varies, the curves Oi(z), 02(z) and O^(z) are transversal to 
{F = z} (see Fig.l). Notice that when g > 0, C(g, z) has only one connected component 
which we call C2(g,z)- When g < 0, C(g,z) has two connected components C\(g,z) 
and C3(g,z) bounding domains on S(z) containing 0\(z) and 03(21) respectively. Let 
Gi(0, z) and 03(0, z) be the parts of the homoclinic trajectory O(0, 2) bounding domains 
containing 0\{z) and 03(21) respectively. Let 02(0, z) = O(0, z). Let Fj(g, z) {i = 1, 2, 3) 
be the region bounded by Ci(g, z). 

In the next two sections when we speak about a stochastic process or a motion on 
the surface S(z), for example Xf, X t ' etc. , we are assuming that they are stopped 
once they hit dS(z). 

To study equation (2.3), we make a time change t i-> -. Let Xf = X^,. We get 
from (2.2) that 




Oi(z) 



0,(z) 




L 

Os(z) 



Fig. 1: The Landau-Lifshitz dynamics 



X\ = -VF(X £ )xVG(X £ )+VF(X £ ) x (VF(X £ )xh(X £ )) , X £ Q = x G M 3 , < e « 1 . 

(2.7) 

Therefore the fast motion is defined by the vector field — VF x VG, and the slow 

e 
motion is due to X7F x (VF x b) (we will sometimes ignore the arguments since they 

could be directly understood from the context). In order to study the limiting behavior 

of the process X s , we introduce a graph T (compare with [14, Chapter 8]). The graph 

r is constructed in the following way. Let us identify the points of each connected 

component of the level sets of G on S(z). Let the identification mapping be 2). The set 

obtained after such an identification, equipped with the natural topology, is a graph T 

with an interior vertex C>2{z) corresponding to the saddle point 02(z) on S(z) and related 

homoclinic curve (in the following we will use the same symbol for either the critical 

point of G on S(z) or the corresponding vertex on T), and two exterior vertices 0\{z) and 

03(2;) corresponding to the stable equilibriums 0\(z) and 02{z) on S(z), together with 

another exterior vertex P corresponding to dS(z) (notice that by our definition S(z) = 

{x GM 3 : G(x) < G(x (z)) + 1} n {x G M 3 : F(x) = z} so that dS(z) is a level curve of 

G on {F = z}). The edges of the graph are defined as follows: edge I2 corresponds to 

trajectories on S(z) lying outside C(0, z); edges I\ and ^3 correspond to those trajectories 

on S(z) belonging to the wells containing Oi(z) and Os(z), respectively. A point 2)(x) = 

y £ r can be characterized by two coordinates (g, k) where g = G(x) is the value of 

function G at x G 2) _1 (y) C S(z), and k = k(x) is the number of the edge of the 

graph r to which y = 2)(x) belongs. Notice that k is not chosen in a unique way since 

for y = 02(z) the value of k can be either 1, 2 or 3. The distance p(yi,y2) between 

two points 2/1 = (G(xi),k) and y 2 = (G(x 2 ),k) is simply p(yi,y 2 ) = \G(xi) - G(x 2 )\. 



For 2/1,2/2 £ T belonging to different edges of the graph it is defined as p(yi,y 2 ) = 
p{ yi ,0 2 {z))+p(0 2 (z),y 2 ). 

The slow component of X s is the projection of X £ on T: Y t £ = %)(X £ ). Using the 
classical averaging principle one can describe the limiting motion of Y t £ as e \. inside the 
edges. But it turns out that the trajectory Yf, when hitting the interior vertex 2 (z) on 
r, is very sensitive to e. This means that Yf = Z)(X £ ), G(X £ ) > G(0 2 (z)), hits 2 (z) in 
a finite time £q such that lim tf, = to exists and finite, and after that alternatively as e \, 

e4-0 

goes to I\ or I3 . The limit of Yf as e \, for t > to does not exist (compare with [4] ) . In 
order to describe the limiting behavior, we have to regularize the problem. To this end 
one can add a small stochastic perturbation of order S either to the initial condition or 
to the equation. Let X t ' be the result of addition of such a perturbation. Then, under 
certain mild assumptions, the slow component %)(X £ ' ) of X £ ' converges weakly in the 
space of continuous trajectories on any finite time interval [0, T] to a stochastic process 
Yt on the graph T as first e I and then 5 ! 0. Since small random perturbations, as 
a rule, are available in the system, exactly this weak limit characterizes the behavior of 
X s , as < e « 1. We will introduce different types of regularization and prove that 
all these regularizations lead to the same limiting stochastic process Yt on T, which we 
calculate. 

The proofs, in Section 3 and, partly, in Section 4, are similar to the case of pertur- 
bations of Hamiltonian systems ([14, Chapter 8], [4]), and we pay most of the attention 
to the arguments which are not presented in these works. For instance, in the case of 
regularization by a random perturbation of the initial point, bounds for the hitting time 
of the homoclinic trajectory are considered in details. 

So far we considered just deterministic perturbations preserving the first integral 
F. Stochastic perturbations were used just for regularization of the problem. One can 
consider also white- noise- type perturbations preserving F of the same or of a larger order 
than deterministic perturbations. Then, in an appropriate time scale, the limiting slow 
motion converges to a diffusion process on a graph (Section 4). In general, deterministic 
and stochastic perturbations have different order, so that, after time rescaling t — > -, 

£ 

the perturbed equation has the form 

Xf = -VF(X £ t ' S )xVG(X £ /) + VF(X £ t ' S )xb(X £ t ' S ) + 5a(X £ t ' S )oWt ,X £ ' S = x . (2.8) 

Here Wt is the standard Gaussian white noise, cr(x) is a smooth matrix-function 
such that o X7F = 0. If we denote by a(x) = a(x)a T (x) the diffusion matrix, the 
condition a T (x)'VF(x) = is equivalent to the assumption that a(x)'VF(x) = 0. The 
stochastic term in (2.8) is understood in the Stratonovich sense, then F(X £ ' ) = F(xo) 
with probability 1. We assume that the matrix a is non-degenerate on {F = z}. (We 
will specify the non-degeneracy in Section 4.) 
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The process X £ ' defined by (2.8) lives on the surface {i £ I 3 : Fix) = z} and has 
a slow and a fast component as e « 1 and 5 > fixed. The slow component is again 
the projection %)(X £ ' ) of X^' on the graph T. We consider the case < £ « 5 « 1 
and assume that the deterministic perturbation is friction-like. 

If e > is small enough and 5 = 0, the system X s ' , X £ ' = x G S(z), has 

three critical points 0[(z), 0' 2 (z), 0' 3 (z) of the same type as the corresponding points 

Oi(z). The distance between corresponding points tends to zero together with e. If 

< 5 « 1, X £ ' 5 , X £ ' S = x, at a time t = T 5 (X), lim<S 2 lnT a (A) = A > 0, is situated 

<54_0 

in a small neighborhood of the metastable state M £ (x, A); M £ (x, A) is one of the stable 
equilibriums of X s ' . The function M e (x, A) is defined by the action functional for the 
family X £ ' S as 5 I (see [8], [10], [13], [14], [21]). 

But if e tends to zero, the situation is different: Xf,' 4 ,.., Xq = xq, converges to a 
random variable distributed between 0\{z) and 0$(z) as < e « 8 « 1. The set of 
possible distributions between the minima is finite and is independent of the stochastic 
part of perturbations. But which of these distributions is realized at a time T (A) 
depends on A and xq = Xq , as well as on stochastic perturbations. We describe these 
metastable distributions in Section 4. 



Perturbations of a more general equation than (2.1) are considered in Section 5. 
The non-perturbed motion in this case, in general, has just one smooth first integral and 
the averaging procedure essentially depends on the topological structure of the connected 
components of level sets of the existing first integral. Each connected component is two 
dimensional orientable compact manifold. The topology of such a manifold is determined 
by its genus. We show that if the genus is greater than zero (for instance, when the 
component is a 2-torus T 2 ), the limiting slow motion spends an exponentially distributed 
random time at some vertices. 

Perturbations of system (2.1) may have different origin and they may have different 
order. In the last Section 6, we briefly consider such a situation. 

Perturbations of (2.1) breaking both first integrals F{x) and G{x) can be consid- 
ered: (after time change) 

XI = -VF{X £ t ) x VG{X £ t ) + B(X £ ) , X £ = x (z) G K 3 , < e « 1 . (2.9) 

Here B(») is a general smooth vector field on R 3 . Then the perturbed motion is not 
restricted to the level surface {F = z}. In this case the slow component of the perturbed 
motion lives on an " open book" n homeomorphic to the set of connected components of 
the level sets C(z±, 22) = {x G IR 3 : F(x) = z±, G{x) = z 2 } , (zi, z 2 ) G M 2 (compare with 
[16]). The slow component of the motion is equal to %)(X £ ) = Y £ , where 2) : M 3 — > n is 



the identification mapping. After an appropriate reguiarization, Yf approaches as e \, 
a stochastic process Yj on n. We will consider this question in more details elsewhere. 

3 Reguiarization by perturbation of the initial condition 

We study in this section the reguiarization of system (2.7) by a stochastic pertur- 
bation of the initial condition. 

Let Ug(x) = {y G S(z) : d(x,y) < 5}. 
Consider the equation: 



Xf = -VF(X £ ' 5 )xVG{Xf s )+VF(X £ ' 5 )x(VF(X £ ' 5 )xb(X £ ' 5 )),X £ ' 5 = x (z,8) G M 3 . 

(3.1) 

Here < 5 « 1 is a small parameter. The initial position xq(z, 8) = X s ' is a 
random variable distributed uniformly in U$(xo(z)) C {F = z}. We are choosing 5 small 
enough so that U$(xo(z)) C S(z). 

Our goal is to prove the following 

Theorem 3.1. Let X £ ' be the solution of equation (3.1), and Y"/' = %)(X £ ' ) be 
the slow component of X t ' . Then, for each T > 0, Y t ' converges weakly in the space 
of continuous functions f: [0, T] — >■ T to a stochastic process Yt(xo(z)) as, first, e 1 
and then 8 1 0. 

We will define the process Yt(xo(z)) later in this section. 

Let us start with the perturbed, but not regularized system (2.7). The motion of 
Xf is on the surface S(z). The change of G{Xf) is governed by the equation 

dG ( X ^ =VG-(-VFx VG + VFx (VFxb)) 
at e 

= VG • (VF x (VF x b)) 

= -(VF x b) • (VF x VG) . 

The function G is a first integral of the unperturbed system (2.1) and the damping 
term VF x (VF x b) of (2.7) plays the role of "friction" which makes the value of G 
smaller and smaller. 

The stable, but not asymptotically stable equilibriums 0\(z) and 0%(z) of (2.1) 
become asymptotically stable equilibriums 0[(z) and 0' 3 (z) for the perturbed system 
(2.7). The saddle point 02(z) becomes the saddle point 0' 2 {z). The distances between 
0\(z) (C-2(z), Os(z)) and 0[(z) (0' 2 (z) , 0' 3 (z)) are less than Ae for a constant A > 0. 
When e is small enough, the pieces of the curves formed by 0[(z), 0' 2 (z) and 0' 3 (z) (as 




Fig. 2: White and grey ribbons 

z varies) are transversal to {F = z}. Separatrices of the saddle point 0' 2 (z) are shown in 
Fig.2. They, roughly speaking, divide the part of the surface S(z) outside the co-shaped 
curve C(0, z) in ribbons: the gray ribbon enters the neighborhood of 0' 1 (z), and the 
white ribbon enters the neighborhood of 0' 3 (z). The width of each ribbon is of order e 
as e \. 0. 

The trajectory Xf has a fast component, which is close to the non-perturbed motion 
(2.1) (with the speed of order — ), and the slow component, which is the projection 
Yf = %)(Xt) of Xf on the graph T corresponding to G{x). Within each edge of the 
graph, say edge Ii, i = 1,2, 3, standard averaging principle works. Let G\ = G(Xf). We 
have, by the standard averaging principle (cf. [1], Ch.10), 



lim sup 

=4-0 0<t<T<oo 



IW 



G t \=0 



The function Gt satisfies Go = G{x${z)) and 

*tp.= B®(G t ) , where 
at 




VG-(VFx (VFxb))- 



c iig , z ) |VF x VG\ , , 

,„„ , , VFxVG „ ^- 2j 

- (VF x b) • d/ 

9(9,2) IV* x VG| 

(VF x b) • vdl 

9.(9,2) 

. . V x (VF x b) • ndm . 

-Tj(S') J JDi(g,z) 

r a] 

Here Tj(g) = ® — is the period of rotation for the unperturbed sys- 

JCi(g,z) l VF x VG| 

tern (2.1) along the curve Ci(g,z). The vector v = — is the unit velocity 

VF(x) 

vector for the unperturbed system (2.1); n = n(x) = is normal to the level 

surface {F = z}. The area element on {F = z} is denoted by dm. We used the Stokes 
formula in the last step. 

Fix a point x${z) on the level surface {F = z} outside the oo - shaped curve 
C(0,z). To be specific, let xq(z) belong to the white ribbon. Let J s (z) be the curve 
on {F = z} containing xq{z) and orthogonal to the perturbed trajectories (2.7). Let 
a(z), b(z),c(z) be the intersection points of r y s {z) with separatrices neighboring to xq(z). 
To be specific, let x${z) lie between b(z) and c(z) (see Fig. 3, where a part of the flow is 
shown). By our transversality condition, we can take A > small enough and a curve 
£(z) , z £ [z — 2A, z + 2A] which lies on the surface {G = G(xq(z))} and is transversal to 
the level surface {F = z}, containing the point xq(z) (£(z) = xo(z)). Let xo(z) = £(z). 
Consider the curve "f s (z) on {F = z} containing xq(z) and orthogonal to the trajectories 
of (2.7). We also consider corresponding neighboring points a(z), b(z), c(z) defined for 
xq{z) in the same way as we did for xq{z). For fixed e > 0, we choose A small enough 
such that as z varies in [z — 2A, z + 2A], the curves a(z), b(z) and c(z) are transversal 
to {F = z}. The part of 7 s (2 r ) between a(z) (b(z)) and 6(5) (c(z)) belongs to the grey 
(white) ribbon for the trajectories of (2.7) on {F = z}. Now we consider the curvilinear 
rectangle Di with vertices a(z + A) , a(z — A) , b(z — A) , b{z + A) constructed in the 
following way: Di consists of the parts of the curves of 7 s (z) from a(z) to b(z) as z 
varies in [z — A, z + A]. We construct another curvilinear rectangle D2 with vertices 
b(z + A) , b(z — A) , c(z — A) , c(z + A) in exactly the same way as Di, but consisting 
of curves 7 s (i) from b(z) to c(z) as z varies in [z — A, z + A]. 

Let vector v be the unit vector outward normal to these two curvilinear rectangles 
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Fig. 3: Transversality 

□i and IH2, pointing in the direction opposite to the perturbed flow (2.7). By the 
divergency theorem in M 3 we see that, for k = 1,2, 



- VF xVG + VFx (VF x b) 



vdm 



div(VF 

z+\ 

dz 

A 



(VF x h))dV 



Jz-\ JJs 



-2A 



SfcO) 



S k (z) 

Vx (VF 



VF 
V x (VF x b) • 7=;dm 



(3.3) 



VF 



b) 



VF 

IVFI 



dm + o(A) 



We have used here the formula div(A x B) = B • (V x A) — A • (V x B). The 
regions S^ and Skiz) (k = 1,2) are defined as follows: E^ is the 3-dimensional region 
filled by trajectories of (2.7) starting from D^ and belonging to the family of level 
surfaces {F = z}, zG [z — A, z + A], k = 1,2; 5fc(F) is the 2-dimensional region filled 
by trajectories of (2.7) starting from D^ n {F = z} and restricted to the family of level 
surfaces {F = z}, k = 1,2, z G [z — A, 2 + A]. Notice that the boundary of the compact 
set £k consist of D^ and a surface formed by the perturbed trajectory. For notational 
convenience the area element on □& (k = 1,2) is denoted also by dm. 

Let L(a(z),b(z)) and L(b(z),c(z)) be, respectively, the arc length of r ) s {z) between 
a(z) andfr(z), and between b(z) andc(z). The flux of the vector field -VF x VG + VF x (VF x b) 
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through Dj. (k = 1, 2) is equal to — / / 



VFxVG + VFx (VF x b; 



dm. 



Let Area(«) denote the area of some domain. Let \J(z, r y s (z))\ / be the Jacobian 
factor between the area element on Di U D 2 and dzd^ s (z). We have 

Area(pi U D 2 ) 

2+A rc(z) 

dz / _ \J(z,-f s (z))\d^ s (z) 

-A J a(z) 

= 2A f ' \J(z, ls (z))\d ls (z) + (I) 

J a(z) 

= 2\\J(z,b(z))\L(a(z),c(z)) +2\(II) + (I) . 
Here 



(I) 



and 



,-+A / rc(z) fc(z) 

dzl / \J(z,j s (z))\dj s (z) - / \J(z,-/ s (z))\d-/ s (z) 

-A \Ja(z) Ja(z) 



(II) = f CZ (J(z, ls (z)) - J(z,b(z)))d ls (z) . 

J a(z) 



Note that \(I)\ < C\X 2 since the function I(z) = / \J(z,j s (z))\d'y s (z) satisfies 

J a(z) 

\I(z\) — 7(^2)1 < C2\z\ — Z2\- We also have \(II)\ < C^L(a(z), c(z)) 2 since \J(z,j s (z)) — 
J(z,b(z))\ < Ci\'~f s (z) — b(z)\ < C§L{a{z) , c(z)) . Combining these estimates with (3.3) 
and the fact that for some constants Cq, Cj > 0, 



C 6 < 1 

e ~ Area(Di U D 2 ) J J Dl un 2 



VF x VG + VF x (VF x b) 



dm < 



C 7 



we see that as e 1 0, the asymptotic widths of the grey and white ribbons (i.e. L(a(z),b(z)) 
and L(b(z),c(z))) are of order 0(e). The next lemma gives the asymptotic ratio of the 
widths: 

Lemma 3.1. Let xq(z) and the points a(z) , b(z) , c(z) be defined as above. Then 



L(a(z),b(z)) _ JJd^z) 



e|0 L(b(z),c(z)) 



V x (VF x b) ■ ndm 



(3.4) 



V x (VF x b) ■ ndm 
'D 3 (0,z) 

Here the domains D±(0,z) and D^(0,z) are the regions bounded by Ci(0, z) and 
C 3 (0,z). 
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The proof of the lemma is similar to the proof of Lemma 3.4 in [4] but based on 
(3.3), rather than on the divergency theorem in ]R 2 , as in [4]. We provide the details in 
the Appendix. 1. □ 

In the following we will fix an initial point x (not necessarily xq{z)) on S(z). We 
put x = f(x) £ f(S(z)) = R = {(a,b) £ E 2 : a 2 + b 2 < 1}. Let us consider the 
trajectory Xf(x) of (2.7) starting from point x. Let Xfix) = f(Xf{x)). Our goal now is 
to estimate the time of "one rotation" of Xf(x) around either 0[(z) or 0' 3 (z) or around 
both of them. 

Note that (in two dimensional case), a neighborhood U of a saddle point of G on 
S(z) exists such that the system can be reduced to a linear one in U C M 2 by a non- 
singular diffeomorphism of the class C 1,a , a > 0. This comes from the corresponding 
result in R 2 ([17, Theorem 7.1]) and the fact that our surface S(z) is C^-diffeomorphic 
to R = {(a, b) G R 2 : a 2 + b 2 < 1}. 

In our case, the system depends on a parameter e, but one can check that neigh- 
borhood U and a > can be chosen the same for all small enough e, and the C 1,a -norm 
of the functions defining the diffeomorphism are bounded uniformly in e. 

For the reason above, it is sufficient to consider the corresponding flow Xfix) on 
R. Such a flow has the same structure consisting of grey and white ribbons on R. For 
notational convenience we will use the same symbols for objects related to such a flow, 
corresponding to our original Xf{x). For example, we will write Xf(x) simply as Xf{x), 
and the set f{U$(x)) as U$(x), etc. . The reader could easily understand which specific 
flow we are referring to from the context. 

The system on R can be linearized in a neighborhood of 0' 2 (z), as described above. 

First, note that if x is situated outside a fixed (independent of e) neighborhood of 
the oo-shaped curve C(0, z), the trajectory Xf(x) comes back to corresponding curve 
7 3 x, orthogonal to the perturbed trajectory , at least, if e > is small enough. 
The time of such a rotation t e (x) < eA(x) (recall that we made time change t —> -); 
A(x) here is independent of e and bounded uniformly in each compact set disjoint with 
C(0,z). 

If x is close to C(0,z), then Xf(x) comes to a ^-neighborhood Us(0' 2 (z)) of 0' 2 (z) 
in a time less than e^, Ag < oo. But the time spent by the trajectory inside the 
neighborhood Us(0' 2 (z)) of 0' 2 (z) can be large for small e; in particular, the separatrices 
entering 0' 2 (z) never leave U§(0' 2 (z)). So we should consider trajectories started at 
distance S from 0' 2 (z) in more detail. 

Let 5 > be so small that U 2 $ (0' 2 (z)), for e small enough, belongs to the neighbor- 
hood U of 0' 2 (z) where our perturbed system can be linearized. The saddle point 0' 2 {z) 
under this transformation goes to the origin O, the separatrices of 0' 2 {z) go to the axis 
x and y, the trajectories Xf go to the trajectories of the linear system (Fig.4). 
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Fig. 4: Linearized system 

One can explicitly calculate the time 9(h, 5) which the linear system trajectory 
needs to go from a point (h,yo) to (S,yi) (Fig.4): 



9(h, S) = const 



mi 

5 



(3.5) 



Let a perturbed trajectory enters U$(0 2 (z)) at a point x G dU$(0' 2 (z)), G{x) > 0, 
and exits U§(0' 2 (z)) at a point y G dU§(0' 2 (z)). We can assume that x and y are close 
enough to the pieces of the separatrices which go to the axises x, y after the linearization 
so that the curves 7 and 7' orthogonal to perturbed trajectories and containing x and 
y respectively cross these pieces of separatrices (these pieces are shown in Fig. 5 as bold 
lines and denoted by numbers 1,2,3,4) at points a and a' (Fig. 5). Let the distance 
between x and the closest last piece of the separatrix entering 0' 2 (z) be equal to h (here 
and below we are using the distance defined by minimal geodesies since we are working 
in a sufficiently small neighborhood). Consider the closest to x separatrix crossing 7 at 
a point b such that G(b) > G(x). Let I be the distance between y and this separatrix. 

If at least one whole ribbon intersects the curve 7 between x and the piece of 
the separatrix entering 2 {z) (and containing point a), the trajectory Xf(x) makes a 
complete rotation around both 0[(z) and 0' 3 (z) and crosses 7 at a point x' £ 7 (case 
1). The time spent by this trajectory outside U$(0 2 (z)) is bounded from above by A\£. 
Since the perturbed system can be linearized in U2s{0 2 (z)) by a C^-diffeomorphism, 
equality (3.5) implies that the transition from x to y takes time less than A2£\ lnh\; A\ 
and A2, in particular, depend on 5, but are independent of e. 

The trajectory Xf(x) comes to dUs(0' 2 (z)) again at the point z (Fig. 5). It follows 
from the divergence theorem that the distance from z to the last piece of the separatrix 
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Fig. 5: Case 1 

entering 0' 2 (z) (and containing the point v in Fig. 5), in the case when Xf(x) comes 
back to x' £ 7, is bounded from below and from above by A%h and A±h respectively. 
Therefore the transition from z to z' also takes time less than A$e\ ln/i|. 

Consider now the case when between the initial point y £ dU$(0' 2 (z)) and the 
last piece of the separatrix entering 0' 2 (z) there is no whole ribbon (Fig. 6). Transition 
between y and y' , because of the same reasons as above, takes time less than Aqe\ ln/i|, 
where h is distance between y and the last piece of separatrix entering 0' 2 (z). But 
complete rotation of the trajectory Xf{y) includes also the transition from z to y" . It 
is easy to check using divergence theorem that, the distance from z to the separatrix 
entering 0' 2 (z) is bounded from below and from above by Ajl and A&l respectively, 
where I is the distance between y and the separatrix crossing 7 at a point b such that 
H(b) > H(y) (Fig. 6). Therefore, the transition time between z and y" is less than 
j4ge|lnZ|, and the whole rotation time for Xf{y) is less than Aioe(|ln/i| + | In Z| ) for 
£ > small enough. 

Denote by t £ {x) the time of complete rotation for the trajectory Xf(x). Suppose x 
is not a critical point of G. We have 



te{x) 



min{£ > : Xfix) crosses twice one of the curves 7 or 7'} . 



Summarizing the above bounds and taking into account that outside Us(0±(z)) U 
Ug(0 2 (z)) U Ug(0' 3 (z)) the trajectory Xf(x) moves with the speed of order e _1 , we get, 
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Fig. 6: Case 2 

Lemma 3.2. Let Xf(x) enters Us(0' 2 (z)) at a point y = y{x) G dUs(0 2 (z)), and 
let h = h(x) be the distance between y(x) and the last piece of a separatrix entering 
0' 2 (z). Let 7 be the curve orthogonal to perturbed trajectories and containing y(x). 

If in one complete rotation, Xf(y{x)) come back to 7, then 



t £ (x) < Ane\ lnh(x) 



(3.6) 



If Xf{y(x)) does not come back to 7, and l{x) is the distance from y(x) to the 
closest separatrix, which crosses 7 at a point b, such that G(b) > G{y{x)), then for 
e > small enough, 



t £ (x) < Ai 2 e(|ln/i(x)| + |ln/(x)|). 



(3.7) 



Now we come back to our original system (2.7) on S(z). Let a be a small positive 
number. Denote by £ a = £ a {e) the set of points x £ S(z) such that the distance between 
x and the closest separatrix is greater than set (since e is small we can work with minimal 
geodesies). Let £& be the intersection of £ a with the gray ribbon; £™ be the intersection 
with the white ribbon. 

Denote by A £ (x,(3) the time when Xf{x) reaches C(/3,z): 

A £ (x,/3)=inf{t>0:G(Xf(x)) = /3}; 
if G{x) > and |/?| is small, A £ (x,f3) < 00 for all small e > 0. 
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Lemma 3.3. Let G(xq(z)) > and let \x > be so small that G(x) > for 
x £ U2^,(xq(z)) . There exist ao, A) > and A13 such that for each x G U^(xq(z)) D £ a , 
aG (0,a ), p€ (0,/3 ), 

A £ (x,-/3)-A £ (x,/3)<^ 13 /3|ln/3| (3.8) 

/or e < e$. Here A\%, in particular, depends on a and (3 but is independent of e; £0 > 
depends on a and (3. 

The proof of this lemma is based on Lemma 3.2 and the fact that each rotation 
decreases the value of G on an amount of order 0(e). Therefore the total time is less 

than A14 ^ e| ln(fe)| ~ / | lnzjtiz < A\^I3\ ln/3| for e > small enough. □ 
k=i Jo 

Proof of Theorem 3.1. Equation (3.2) can be considered for each of three edges of 
the graph Y corresponding to G(x) on S(z): for i = 1,2,3, we have 



dl (3.10) 



9i 

' i i\yt 



^{9?) 



S 2 ) 



Tl{9) T Ci{g , z) |VF x VG| 

B®(g, z) = - J J V x (VF x b) • ndm . 

Equation (3.10) for i = 1 can be solved for each initial condition g y ^' — g > 0, 

(2) 
g < max{G(u)) : w G dS(z)}. Such a solution is unique, and g\ reaches in a finite 

time To(g, z). If i = 1, 3, equation (3.10) with initial condition g = g < has a unique 

solution; if g^' = 0, equation (3.10) has a unique solution <? t if we additionally assume 

that g? < for t > 0. 

Define two continuous functions g}(g) and gf(g), t > 0, as follows: g$ =do = g > 0, 

-1/ n \ 9t > % = 5 , < t < r (#, 2) , 
& (5) = < ~(i) . . 

{ 9\. My , z) ,Mg,z)<t<oo; 

-3/ x f % » 30 = 3 , < £ < To(#, z) , 

St (3) = < ~(3) , v . , ^ 

Let us cut out ole- neighborhoods of the separatrices (//- neighborhood of a point xq, 
G{xq) > 0, is shown in Fig.7); recall that £ a is the exterior of the ea-neighborhood of 
the separatrices, £| is the intersection of £ a with the gray ribbon, £™ is the intersection 
of £ a with the white ribbon. In particular, £q (£q) is whole gray (white). 
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Fig. 7. 



The classical averaging principle together with Lemma 3.3 imply that for each 
x G Us(xq(z)) H £a, G(x) = g > 0, for any A,T > 0, and any small enough a, 5 > 0, 
there exists Eq > such that 



TUK\H(XZ(x))-g}(g)\<\ 

for < e < eq. 

Similarly, for each x € E/,$(a;o(z)) n £™, G{x) = g > 0, 



(3.11) 



o maxJtf(Xf(aO)-3f(<7)l<A 



(3.12) 



for < e < eo- 
Let G(x) > for x £ Ug(xo(z)) so that %)(Ug(xo(z))) C h C T. Define a stochastic 
process Y t "(xo(z)), t > 0, on T as follows: (recall that the pair (k,g), where k is the 
number of an edge, k G {1,2,3}, and g is the value of G(x) on 2) _1 (y), y G T, form a 
global coordinate system on T) 

y/(x (z)) = (2,^(G , (x (z,<5)))) for < t < t (x (z,S)). 

(2) (2) 

(Recall that tq(xq(z, 5)) is the first time when the process g t , <7 G(xq(z,8)) > 
in (3.10) reaches 0.) 

At the time To(xq(z, 6)) the process Y t (xq(z,S)) reaches 02(z) and without any 
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delay goes to I± or 1$ with probabilities 



Pi 



Dt(p,z) 



V x (VF x b) ■ ndm 



(3.13) 



V x (VF x b) ■ ndm + V x (VF x b) ■ ndm 

Di(0,z) JJd 3 (o,z) 



P3 



D 3 (0,z) 



V x (VF x b) ■ ndm 



(3.14) 



V x (VF x b) ■ ndm + V x (VF x b) ■ ndm 

'Di(0,z) JJd 3 (0,z) 

respectively; y/(x (z)) = (l,?t_^( xo ( Z!< j))(a;o(z,<$)) for t (x (z,S)) < t < 00 if r/(x (z)) 
enters F at time t (x (z, 5)), andY t 5 (x (z)) = {^,9^. T0 ^ xo ^ 5 ))( x o{z,S)) (01 t (x (z, 5)) < 
t < 00 if Y t °(xo(z, 6)) enters ^3 at time to(xq(z, /j,)). 

One can consider a process Y t (xo(z)) = Y t °(xo(z)) on F: Y t (xo(z)) is deterministic 
inside the edges and governed by equations (3.10); its stochasticity concentrated at the 
vertex 02(z): after reaching 02(z), Y t (xo(z)) immediately goes to I\ or to I3 with 
probabilities p\ or p% defined by equalities (3.13) and (3.14). 

Denote by Area(£>), D C S(z), the area of a domain D. Since the point xq{z, 5) is 
distributed uniformly in Ug(xo(z)), 



P{X £ t ' 5 enters Di(0,z)} 



Aiea(£ln U 5 (x (z))) 



,e,S 



P{Xp° enters D 3 {0,z)} 
as e \, 0. According to Lemma 3.2, 



Are&(U s (x (z))) 
Area^ riU s (x (z))) 



0. 



(3.15) 



Area(Us(x (z))) 



lim 



Area(£lr\U^x )) 



pi , lim 

eJX) 



Area(fy D t^(xo)) 



P3 



(3.16) 



Area(£/" M (x )) ^ ' 'slo Area(£/" M (x )) 

where j»i and ^3 are defined in (3.13) and (3.14). 

Taking into account that Area(£anUs(xo(z))) — > Area(£QnUs(xo(z))) and Area(£™ n 
Ug(xo(z))) — > Area(£g" n Us(xo(z))) as a 1 0, we derive from (3.10)-(3.16) that, for each 
T > 0, the slow component 2)(-X^' ) of X^' converges weakly in the space of continuous 
functions on [0, T] with values in T to the process Y t °(xo(z)). 

It is easy to see that Y t °(xq(z)) converges weakly to Y t (xo(z)) as S I 0. 

This gives the proof of Theorem 3.1. □ 
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4 Regular izat ion by stochastic perturbation of the dynam- 
ics 

Let now the perturbation have deterministic and stochastic parts: 



Xf = -(VF x VG){xf) + VF x (VF x h)(X £ t ' S ) + Sa(X £ ' S ) o W t , (4.1) 

Xq = xo(z) G R 3 , F(xo(z)) = z, and 5 > 0, < e << 1. The stochastic term 
o~(X 6 ' ) o Wt is understood in the Stratonovich sense. The 3x3 matrix o~(x) = {aij{x)) 
is assumed to be smooth and satisfy the relation a T X7F = 0. If we denote by a(x) = 
(ciij(x)) = o~(x)o~ T (x) the diffusion matrix, the condition a T (x)X7F(x) = is equivalent 
to the assumption that a(x)VF(x) = 0. By using the ltd formula for Stratonovich 
integrals, we have, that 



dF(xf) =VF 
dt 



-(VF x VG)(X £ /) + VF x (VF x h)(X £ /) + 5a(X £ /) o W t 







(One can directly check that equality c T VF = implies VF-croW^ = .) In particular, 
if b(x) = 0, we have pure stochastic perturbations. Therefore F is a first integral 
for system (4.1), i.e., the process X t ' never leaves the surface {F = z}. We also 
assume that e • (a(x)e) > a|e| 2 for a constant a > and every e G M 3 such that 
e • VF(x) = 0. This means that the process X £ ' is non-degenerate if considered on the 
manifold S(z) C {i G I 3 : F(x) = z}. Recall that we stop our process X s ' once it hits 
dS(z). The resulting process is still called X s ' . 

We will make use of the following simple Lemma (see, for instance, [20, page 36, 
formula (3.3.6)]): 

Lemma 4.1. Let b(s, x) : JR+ x R — > R be Lipschitz and bounded in s and x. Let 
a(s,x) : R+ x M. d — > M. d x R d be bounded, Lipschitz in s, and differentiable in x. Let o~ij 
is the (i,j)-th element of matrix a. Consider the diffusion process 

X t = x+ I b(a, X s )ds + / a(s, X s ) o dW s 
Jo Jo 

in M. d , where the stochastic term is understood in Stratonovich sense. Then we have 

X t = x+ I b(s,X s )ds + - / c(s,X s )ds+ / a(s,X s )dW s 
Jo * Jo Jo 

where the stochastic term is understood in the Ltd sense. Here vector c(s, x) G R rf has 

d q g .. 
i-th component Yl a %J a kj , 1 < i < d. 

j',fe=l ® x k 
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Using this Lemma, we easily write equation (4.1) in the ltd sense: 
Xf = -(VF x VG){xf) + VF x (VF x b)(X £ t ' S ) + 5<y{xf)W t + —V{X £ t > 6 ) . (4.2) 

„ ^ dijij 

Here SI is a vector in M. with the i-th component Sj = 22 a ^"fcj f° r ^ = 1,2, 3. 

The generator L of the process Xp is written as 



Lu( x ) = _ V" a i7 -(x) „ - + -VF x VG + VF x (VF x b) + — S • V«(i) . 
2 ^— ' oxiOXj \e 2 / 

(4.3) 

Using Ito's formula we see that 



G(X^) - G(x (*)) = 5 f\vCf) T (X e /)a(X e /)dW 8 + 

Jo 

+ / VG-(VFx(VFxb)) + -VG-£ + - V 
- 70 V 2 2 . , =i 



-^ I w>*- 



(4.4) 

Now we are in a position to use the standard averaging principle (see, for example, 
[14, Chapter 8]), to check that within edge Ii (i = 1, 2, 3) of the graph r, as e 4- and S is 
fixed, the process G(Xp ) converges weakly to the process G* governed by the operator 

-T 1 ( A , X ^ A , N ^ A , C\ d ^ 1 T>< A ^ 

Lj = T^r (5 ' z)+ ^ M(5 ' z)+ ^^ (5 ' z) J^ + TF^^ (5 ' 2) ^ • (45) 

The coefficients are 

M 9 , z) = £ m VG • (VF x (VF x b))^^ , 

A ^ = L, VG '*W^VG\> 

a < ^ I ^ d2 ° dl (46) 

JCiig.z)^ J dxid Xj \VFxVG\ 

Here Ti{g) is the period of rotation of the unperturbed system (2.1) on Ci(g,z): 
r ai 

Ti(g) = f — , where dl is the length element on Ci(g, z). 

JCi{g,z) |VF X VG| 

We define a process Y t on T as follows: Y t is a Markov process on F, stopped once it 
hits exterior vertex P (recall that we stop our process X%' once it hits dS(z); also recall 
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that by our definition S(z) = {x £R 3 : G{x) < G(x (z)) + l}n{i£R 3 : F(x) = z} so 

that dS(z) is a trajectory, corresponding to vertex P on V) and governed by a generator 

A. The operator A is defined as follows. The domain of definition for the generator 

A consists of functions f(g,k) on T which are twice continuously differentiable in the 

variable g within the interior part of each edge If, inside Ii, Af(g,i) = Lif(g,i), and 

finite limits lim Af{y) (which are taken as the value of Af at vertex Oi(z)) and finite 
y-+Oi{z) 

df df 

one sided limits lim — — (q,i), lim -rr-(q,i) exist. We set lim Af(y) = (taken 

as the value of Af at point P, this means that the process Y t is stopped at the point 
P). For the interior vertex 02(z), f satisfies the gluing condition: 

V(±)/?2i lim ^(j,t)=0, (4.7) 

where + sign is for the limit taking within edge I2 and — sign is for the limit taking 
within edge I\ and I3. The coefficients ^2,1 are defined by 

^'L 1 " 01 '^' (4 - 8) 

Exterior vertex 0±(z) and 0%{z) are inaccessible. Such a process Y t ° on F exists and is 
unique ([14, Chapter 8]). 

Theorem 4.1. As e \. and 5 is fixed, the process 2)(-X^' ) converges weakly in 
the space of continuous functions f : [0, T] — > F, < T < 00, to the process Y t s . 

The proof of this Theorem is based on the fact that we can carry the dynamics of 
(3.1) on S(z) to a corresponding one on R C M? by the C°°-diffeomorphism / : S(z) —> 
R 2 . We denote f(x 1 ,X2,x 3 ) = (fi(x 1 ,x 2 ,x 3 ), f 2 (x 1 ,X2,x 3 )) , (xi,x 2 ,x 3 ) £ S(z). Let 
Z}' = f(Xt' ) be the image of the diffusion process on M 2 . Using the ltd formula for 
Stratonovich integrals, we have 

7 e,S 



dz; 



df(xf) 

{Df){f-\zf))dxf 
V^'VmW) dt + 5{Df){f-\Z e /))a{f-\Z e /))odW t 



£ 

-(3(Z^ d ) +pi{Z £ t A ) ) dt + Sa(Zf>°) odW, 



1 ■' ~'5\ , O /r 7 e,8s\ ,. , z-fryej-, 



so that Z}' = (/i(.X"|' ), f2(Xf' )) is a diffusion process on M 2 , stopped once it hits dR. 
Here the matrix Df is the differential of /: Df = I — — - ) . The vec- 

\OXj J l<i<2,l<j<3 

tor fields 0(Z) = (Df)(VFx VG)(f- 1 (Z)) and /3 X (Z) = (Df)(VFx(VFxb))(f-\Z)). 
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The 2x2 matrix? is defined in the following way: a(Z)odW t = (D f)(f- l (Z))a(f- l (Z))o 
dWt, where Wj is the standard 3-dimensional Wiener process and Wt is the standard 
2-dimensional Wiener process. The integral curves of the vector field (3 has one saddle 
point /((^(z)) an d two stable equilibriums f{0\(z)) and f(0^{z)). 

We define Q{Z) = G{f- 1 {Z)) for Z G M. 2 . The function Q serves as the first 
integral for the vector field /3: VQ ■ (3 = 0. Furthermore, it is easy to check that 
j3{Z) = k(Z)VQ(Z) with n / 0, so that our system just by a non-singular time change 
differs from a Hamiltonian system with one degree of freedom. Therefore one can use 
the same arguments as in the case of 2-dimensional Hamiltonian systems (see, [14, 
Chapter 8], [13], [11]) to calculate the limiting behavior the process Zf as e \, 0. (In 
the calculation of the gluing conditions, the problem caused by additional drift term (3\ 
and another drift term related to the Stratonovich integral can be resolved using the 
absolute continuous transformation; detailed estimates see [11] and Appendix. 2.) The 
coefficients of the gluing condition at the interior vertex are given as follows: 

&,«=/ l(VS) T 5f§, 

Jf(Ci(o,z)) \/3\ 

where dl z is the length element on /(Q(0, z)). Note that they coincide with (4.8), since 
equality 

(VQfa o dW t = {VG) T (Df- l ){Df)a o dW t = {VG) T a o dW t 

implies 

\(Vg) T a\ 2 = \(VG) T a\ 2 , 

dl z dl 

and S = ^W' D 

The next step is to consider the limit as 5 ], of the process Y t . This follows the 
same line of argument as in [4, Section 2]. In particular, one can do a similar calculation 
as in Lemma 2.2 of [4]. The additional small drift term depending on 5 (caused by 
the Stratonovich integral) in (4.5) will disappear as 5 ! 0. (We briefly indicate how to 
calculate this in Appendix. 3.) We therefore have a limiting process Yt on T defined as 
follows: Yt = (g t f , kt) is a deterministic motion inside each edge of T with gf satisfying 
the differential equation (3.10) and the branching probability for Yt at vertex 0<i{z) is 
given by (3.13) and (3.14). The process Yj spends time zero at the vertex Oi{z\ These 
arguments imply 

Theorem 4.2. As S 1 0, the process Y t converges weakly in the space of continuous 
functions f : [0, T] — > T, < T < oo ; to the process Y t . 
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Theorem 4.1 and 4.2 imply that the slow component 2)(Jf|' ) of the process X^' 
converges weakly to the process Yj on the graph T. Note that Yj is independent of 
the diffusion matrix a(x) = a(x)a T (x) and is the same process which we had using 
regularization by stochastic perturbation of the initial point. 

Consider process X^' defined by (4.1). Under the assumption that the determinis- 
tic perturbation in (4.1) is friction-like, for e > small enough and fixed, the equilibrium 
0[(z) and 0' 3 (z) are asymptotically stable for the dynamical system Xp on {F(x) = z). 
The process X\' is close to X%' on any fixed time interval if 5 is small enough. But on 

time intervals of order exp < —r > for A > 0, X%' may perform transitions between the 

neighborhoods of 0[(z) and 0' 3 (z) due to the large deviations from X^' . In a generic 
case, for x G {F(x) = z] and A > 0, there exists just one stable equilibrium M £ (x, A) 
(in the case of two stable equilibriums, M £ (x,X) = 0[(z) or M £ (x,X) = 0' 3 (z)) such 
that with probability close to 1 as S \, 0, X^ S( . is situated in a small neighborhood of 

M £ (x, A), if Xft = x, lim<5 2 InT (A) = A. The state M £ (x, A) is called metastable state 
u 54.0 

for a given initial point x and time scale A > (see [8], [10] where the procedure for 
calculating M £ (x,\) is described). 

But it turns out that the function M £ {x, A) is very sensitive to e as e \. 0: For A 
not very large, M £ (x, A) alternatively is equal to 0[(z) or to 0' 3 (z) as e \. 0. Moreover, 
for small e, M £ (x,X) is sensitive to changes of the initial point x as well. Therefore, 
if e << 1, the notion of metastability should be modified (compare with [3], [9]): For 
given x and A, one should consider the set of metastable distributions between the stable 
equilibriums. In general, there exists a finite number of distributions on the set of stable 
equilibriums which serve as limiting distributions of X^ g ,.. as first e ], and then 5 \, 0. 
The set of metastable distributions is independent of the stochastic terms in (4.1) and 
defined just by the deterministic system and deterministic perturbations. But which of 



rE,8 V£,S 



those distributions serves as limiting distribution of X' S( +, X ' = x, is defined by the 
stochastic term in (4.1). 

In our case, when we have just two stable equilibriums 0±(z) and 0^,{z), three 
distributions can serve as metastable distribution: first, the distribution concentrated 
at Oi(z), second, the distribution concentrated at 0^(z), and third, the distribution 
between 0\(z) and 0^{z) with P{Oi(z)} = p±, P{Os(z)} = p 3 where p\ and ps defined 
by (3.13), (3.14). 

tv. ai t * x f° M9,*)dg ^ f° A 3 (g,z)dg 

Theorem 4.3. Let \\ = — I r— < / — = A3, where 

JG{Oi(z)) ^i{9,z) Jg(0 3 (z)) B 3 (g,z) 



(5^0 
small enough h > 



Ai(g,z) and Bi(g,z) are defined by (4-6). Let lim<5 InT (A) = A > 0. Then for each 
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limlimP x {|^f (A) -Oi(z)\<h} = \ if 2)(x) G h and A < Ai, 



lim]imP x {|X^? m -0 3 (z)| < /i} = 1 i/ 2)(x) G J 2 and A > or i/ A > A 3 for any x G {F(x) = z}, 



5J.0 e4.o 



L T«(A) 



limlimP^lXif - Oi(z)\ <h}= Pi ,i£ {1,3} , »/ 2)(x) G J 3 and A < A x . 

54.0 s4-0 *• ' 

The probabilities p\ and p^, are defined by (3.13)-(3.14)- 

The proof follows from Theorem 4.1 and the fact that the transition time from 0\(z) 

to Os(z) (from 3 (z) to 0\(z)) for the process Y t on T is logarithmically equivalent as 
Ai I , f A 3 



5 I to exp <^ -i ^ (exp < t§ f ) (Theorem 4.4.2 in [14]). □ 

Remark: We assumed in Sections 3 and 4 that the function G(x) has in S(z) just 
one saddle point and two minima. We also assumed that the deterministic perturba- 
tions are friction-like. Then each minimum point become asymptotically stable for the 
perturbed system. It is not difficult to check that if G(x) has on the set S(z) = {F = z] 
(we assumed it has only one connected component) more than two minima points and 
several saddle points but just one local maximum, and the deterministic perturbations 
are friction-like, then the system can be regularized by an addition of stochastic per- 
turbations of the initial point or of the dynamics. Corresponding graph in this case has 
several interior vertices corresponding to the saddle points of G{x) and exterior vertices 
corresponding to the extremums. 

Inside each edge, the limiting slow motion is governed by corresponding equation 
(3.2). The exterior vertices are inaccessible in finite time. The limiting slow motion 
spends time zero at interior vertices, and the branching at each interior vertex occurs 
exactly as in the case of a unique saddle point. The branching at each interior point is 
independent of the previous behavior of the limiting slow motion. 

But situation is a bit different if G(x) has on S(z) more than one maxima or if the 
perturbations are not friction-like. In this case, in general, it is impossible to regularize 
the problem by a random perturbation of the initial point: the limit of 2)(X^' ) as e \, 
may not exist (compare with [4]). The regularization by stochastic perturbations of 
the equation, as we did in Section 4, is possible under mild additional assumptions. 
One should keep in mind that, if the deterministic perturbation is not friction-like, the 
stochastic branching occurs just at those interior vertices where there are two "exit" 
edges and one "entrance" edge (this means that the limiting slow motion along an edge 
attached to the vertex is, respectively, directed from or to the vertex). 
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Note that, since we assume that lira F(x) = oo, at least one local maximum of 

|x|— >oo 

G(x) is available on each connected component of every level set of F(x). 



5 Positive genus level set components 

Consider a slightly more general equation 

X t = VF(X t ) x d(X t ) , X = x{z) , (5.1) 

where the initial point x(z) belongs to one of the connected components M = M(z) 
of the level set {x 6 M 3 : F(x) = z}. As before, we assume that Fix) is smooth 
enough, lim F(x) = oo, and VF(i) 7^ for x £ M, so that M is a compact connected 

ir| — >oo 

orientable two-dimensional surface in M 3 . 

The vector field d(x), x G M 3 , is assumed to be smooth and the vector field V-F(x) x 
d(x) has, at most, a finite number of rest points on M. Moreover, assume that 



V x d(x) = for x e M . 

Note that in the case of equation (2.1), d(x) = VG(x), and the last assumption is 
satisfied. 

We will make use of the following 

Lemma 5.1. The measure on M(z) with the density with respect to the surface 
area proportional to is invariant for the flow (5.1) on M(z). 

Proof. Let us consider an auxiliary system 

l t = VF(X t ) x d( -^ ~ Q = iW 
|VF(X t )| 

which is a time change of system (5.1). Take any closed non self-intersecting curve 7 on 

M bounding a region .0(7) on M. Let the unit vector field ei be outward normal to 7, 

VF 

but tangent to M. Let e3 = . Let the unit vector field e2 be tangent to 7 and M: 

I V-T I 

e 2 = e 3 xej. We have 

VF \ 

x d • e\dl 



- mVFI 



® (d x 63) • e\dl 



i> d ■ (e3 x ei) dl = — <p d ■ e2dl = — / / V x ddm = 

'7 /y JJd(j) 
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(Here dm is the area element on M.) 

Therefore the flow of the auxiliary system Xt is incompressible (divergence- free) on 
M. Thus the standard surface area (induced by the metric element in R 3 ) is invariant 



for Xt- Since Xt is a time change of Xt with a factor |V-F(x)|, we see that the measure 

on M(z) with the density proportional to — — - is invariant for flow (5.1) on M(z). 

\vF(x)\ 

□ 



The topological structure of a compact two-dimensional orientable connected man- 
ifold M is uniquely determined by its genus. If the genus of M is zero, the condition 
V x d(x) = for M implies that d(x) = VG(x) for a smooth function G(x). Perturba- 
tion theory for such systems was considered in Sections 3 and 4. 

But in the case when M has higher genus, situation is more complicated. Let 
us consider, for example, the case when the genus of M is 1 so that M = T 2 is a 
two-dimensional torus. The general structure of an area preserving flow on a torus is 
described in [2] (Also see [19, Theorem 3.1.7]. Here not exactly the area is preserved, 
but a measure with strictly positive and bounded density. Then the structure of the 
trajectories is similar to the case of area-preserving systems on M): There exist finitely 
many domains Uk C T 2 (k = l,...,n), bounded by the separatrices of the flow, such 
that the trajectories of the dynamical system (5.1) in each Uk behaves as in a part of 
the plane: they are either periodic or tend to a point where the vector field is equal to 
zero. Outside of the domains Uk the trajectories form one ergodic class. Let this ergodic 

n 

class be £ = T 2 \ ( |J Uk) (here and below Uk is the closure of Uk)- Within each Uk 

fc=i 
the system (5.1) behaves like a standard Hamiltonian system with a Hamiltonian Hk- 

For brevity let us assume that each Uk contains only one maxima or minima of Hk and 

no saddles (the case when there is a saddle can be resolved using the results of previous 

sections). Let us denote the maxima or minima of Hk in Uk by M&. Let Ak be the 

saddles of (5.1) on T 2 : Ak is situated on the boundary of Uk- Let us introduce a family 

of functions hk(x) = Hk(x) — Hk(Ak) when x S Uk and hk(x) = when x 6 T 2 \ Uk, 

k = l,...,n. Let the set {x 6 Uk',hk(x) = hk} be jk(hk)- We notice that 7^(0) is the 

separatrix bounding Uk and containing Ak- 

Identify all points of the ergodic class £ as well as the points belonging to each 

level set of each function Hk(x), x G Uk- Let 2) be the identification mapping. Then 

2J(M), in the natural topology, is homeomorphic to a graph G. This graph is a tree, 

and 2J maps the entire ergodic class £ to the root of the graph which is denoted by 

O. Let 7fe(/i) = {x G Uk ■ hk(x) = h}. Define a metric p(yi,i/2) on G as follows: If 

V\ = ?)(7fc0i))> 2/2 = ?)(7K /l 2)), put p{yi,y 2 ) = \h - h 2 \ for k = I, and p(yi,y 2 ) = 

p(yi, O) + p(0, y 2 ) if k ^ I. In this way the region Uk will be mapped into a segment Ik 

of the form either [0, hk(Mk)] (if Mk is a maximum) or [hk(Mk), 0] (if Mk is a minimum). 
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All these segments I k serve as edges of our graph G and they share the common root O. 
Every point y = 2)(x) onG\0 can be given a coordinate (k, h k ) where k is the number 
of the edge containing y and h k = h k {x). In this way our mapping 2) is explicitly written 
as 2)(x) = O if x G £ and 2)(x) = (k, h k {x)) if x G t/jfc. 

Let us now introduce a deterministic perturbation and a stochastic regularization 
to our system (5.1). After the time change 1 1- > -, our perturbed system has the form 

X £ / = -VF(X £ ' 5 )xd(xf)+VF(X £ /)xp(xf)+5a(X e /)oW t , X £ ' 5 = x (z) . (5.2) 

Here p(») is a smooth vector field in M 3 and a is the same matrix defined in Section 
4. We remind the reader that o X7F = and a = (a,ij) = aa is the diffusion matrix. 
We also recall that we have the non-degeneracy conditions of a on M: e- (a(x)e) > a\e\ 2 
for some a > and all e such that e • X7F = 0. The process X s ' lives on the surface M. 

Let us define a strong Markov process Y t on G as the diffusion process on G 
governed by a generator A such that, at each interior point (k,h k ) of an edge I k , 
Af{k,h k ) = L k f(k,h k ), where 



-F *n , x 1 ( ix x S 2 n x & i, A df & 1 , n n &* f 

LkJ{k,h k ) = a k (h k ) + —a lyk (h k ) + —a 2yk {h k ) —— + — - r b k (h k " 



T k (h k ) \r KV ' K/ 2 " 1 '* v ""' ' 2^" V ""V a/i fc ' 2T k {h k Y KV " K 'dhl ' 

(5.3) 



with 



a fc (/ifc) = / Vi? fe • (VF x p) 
oi,fc(/»fc) = Vi7 fc • E 



7 fe fe) ' Vi? X d ' 



dl 

dl 



iM iVF x d 



a.2,k(h k ) = f ^2 



1 ^ efl (5-4) 



«, 



-,,,/„)—! "^asj |VFxd| 

T 1 1 2 



b k (h k )=l \(VH k fa 

J"/i.(hi.) 



and 



'Tfc(fcfc) |VF X d| ' 



7fc(/lfc S 



7^) l Vi? X d l 



is the period of one rotation along "f k (h k ). Here the vector I] is the same vector as in 
Section 4. 

The domain D{A) of A consists of those functions / that are continuous on G and 
have the following properties. 

• Function / is twice continuously differentiable in the interior of each of the edges. 
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• We have the one sided limits lim L k f(k,h k ) and lim L k f(k,h k ) at the 

h k ~^0 ' ' h k ^th k (M k ) 

endpoints of each of the edges. The values of the limit q = lim L k f(k, h k ) are the same 

h k ^0 

for all the edges. 



The following gluing condition is satisfied at O: 

df_ 

dh k 



f>)A lim Jf(k,h k ) = q, (5.5) 

f— ' h k ^o dh k 

k=i 

with sign + if A k is a local minimum of H k restricted on U k and sign — otherwise. Here 

dl 



& = 7tW/ l(V# fc )V 

W J~y k (0) 



X(£)J lk( o) ,y ' ' |VFxd| 

with 

\(£)= n dm 



£|VF| 



( Here dm is the area element on M.) These conditions define the process Y t " on G in 
a unique way. 

We have the following 

Theorem 5.1. The process Y^' = %)(X £ ' ) converges weakly in the space of 
continuous trajectories [0,T] — >■ G as e \. to Y t . 

The proof of this theorem is an application of Theorem 1 of [6]. To be precise, 
in formula (5) of [6], we set k = 5 2 , v{X £ /) = VF{X £ ' S ) x d(X £ t ' 5 ), /3(X £ t ' S ) = 

VF(X £ ' ) x p(X 6 ' ), u(X 6 ' ) = (a term which comes from the Stratonovich 

integral), o~(X 6 ' ) = o~(X £ ' ). Furthermore, we can write down the generator L of X £ ' 
in self-adjoint form 

3 do~kj 

Here II is a 3-vector with the i-th component Hi = Yl ~R — °~ij- Notice that since 



hk- 



'--i dx k 



do- k j sr^ OF 



a T VF = 0, we have VF • II = N — — - y o~a — — = 0. Also notice that since we have 

^ OX k f-f OXi 



j.k=l v i=l 
is a co 
Therefore, we have CF(x) = where 



checked the fact that F(X 6 ' ) is a constant of motion, Ito's formula imply LF(x) = 0. 



Cu 
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From here we see that the auxiliary process X^' , Xq = xq(z) corresponding to the 
operator C lives on the surface M = M(z). Since C is self-adjoint in M 3 , the (degenerate) 
process X^' has an invariant measure proportional to M 3 Lebesgure measure. This 
implies, that the process X^' , viewed as a non-degenerate diffusion process on M, has 

a unique invariant measure with density proportional to — — - (with respect to the 

|VF(x)| 

surface area element dm on M). Since we have checked that the deterministic flow (5.1) 

on M also has an invariant measure with density proportional to r^— , we see that 

\\F(x)\ 

the auxiliary process Xf , Xf = xq(z) governed by the operator 



£u = — > — > an—- + -VF x d • Vn 




is a non-degenerate diffusion process on M with a unique invariant measure which has 

a density proportional to rrr- This fact, together with the standard method of 

\\/F(x)\ 

absolutely continuous change of measure (see [11] and compare with Appendix. 2), allow 
us to calculate the gluing condition (5.5). 

Since the small random perturbation term 5a o Wt in (5.2) is only introduced as a 
regularization, we must study the limit of Y t as 5 \. 0. It follows from the same argument 
as in Section 3 of [6] that the limiting process Yt should be described as follows. Let 

r ai 

^ k = 2j> VH k - (VF x p) + . 

J lk (o) IV* x d| 

Let Sfc, 1 < fc < n, take values and 1. We set s k = 1 if ipk > anc ^ ^k is a local 
maximum of H k as well as if ip k < and M k is a local minimum of H k - Otherwise we 
set s k = 0. Let 

r k = S -^f4,Kk<n. 
k 2\(£) ' - " 

Then we can describe Yt as follows. 

• The process Yt is a strong Markov process with continuous trajectories. 

• If Yq = O, where O is the root of G, then the process spends a random time r in O. 
There is a random variable £ that is independent of r, taking values in the set {1, ..., n}, 
such that Yt £ 1^ for t > r. If s k = for all k, 1 < k < n then r = oo. If s k = 1 for 

n 

some k then r is distributed as an exponential random variable with expectation ^ r k . 

k=l 

If s k = 1 for some k then 

P(Y t e I k ,t > r) = ^- . 

k=l 
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• If lb £ Intifc then 

alt v ; 

for t < a where a = inf(i :Y t = 0) and B fc (fc fc ) = J^^- . 

Theorem 5.2. As 5 J, 0, £/ie process Y t converges weakly in the space of continuous 
trajectories [0, T] — > G ; to i/ie process Y t . 

The proof is an application of Theorem 2 in [6]. (See the explanation in the proof 
of Theorem 5.1.) 

In the more general situation when the surface M has higher genus, the situation 
is similar (compare with [7]). In particular, corresponding graph may be not a tree; 
it can have more than one special vertices where the limiting Markov process spends 
random time with exponential distribution; transitions between those special vertices 
are possible. 



6 Multiscale perturbations 

Equation (2.1) has two first integrals F(x) and G(x). These integrals may have 
different nature and their perturbations may have different order. Consider the case 
when the perturbed system has the form 

xr = vF(xn x vc(xn + vw^ 5 ) * w? + ^ w> * w? , 

e, k > , Xq' k = x G M C {y € K 3 : F(y) = z} , 

where M is a connected component of the level set {F(x) = z}; o~i(x) and o~2(x) are 
3x 3-matrices; W} and Wf are independent white noises in R 3 . Put a±(x) = ai(x)o~i (x) , 
a>2{x) = CF2{x)o~2{ x )- Sign "*" in the stochastic terms means that the stochastic integrals 
are defined in such a way, that the generator of the process X^' K is as follows 

L £ ' K u(x) = (VF(x) x VG(x)) • Vu(x) + ^div(oi(x)Vu(x)) + |div(o 2 (x)V«(x)) . (6.2) 

We assume that ai(x)VF(x) = and e • (ai(x)e) > a 1 |e| 2 for each e such that 
e • V-F(x) = 0, a_i is a positive constant. The matrix a 2 (x) is assumed to be non- 
degenerate. The assumptions concerning ai(x) imply that the process X 4 ' K moves on 
the surface M: ~P{X t ' K £ M} = 1. This follows directly from the Ito formula (we refer 
the reader to the proof of Theorem 5.1 in Section 5, where we did a similar calculation). 
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Moreover, the process X t ' K on M is non-degenerate. This implies that, for any k > 0, 
the process X t ' K has on the compact manifold M (we assume that lim F(x) = oo) 

|x|— >OD 

a unique invariant measure. On the other hand, the drift in (6.2) is divergence-free 

and the main part is formally self-adjoint. Therefore the Lebesgue measure is invariant 

C 
for the process X^' K , and in particular for X t ,K , in 1R 3 . This implies that 



dm 



\VF(x) 



C = / ,,_,,-,, r: , where dm is the surface area on M, is the density of the unique 

\Jm NF(x)\J 

invariant measure of X t ' on M for each k > 0. 

Assume that 0<£<<k<1. This means that we have relatively large perturba- 
tions of the first integral G(x) and much smaller perturbations of F(x). On the time 
intervals of order — , one can omit the term y/e<T2(x) * Wf in (6.1): the first integral 
F(Xl ,K ) does not change on such intervals as < e « k « 1, and the evolution of 
G(Xf ,K ) asymptotically coincides with the evolution of G{X t ' K ) and can be described 
using the results of Section 4. 

But on time intervals of order — > — , the situation is different. Consider process 

£ K 

Xl' K = X^', K . The process X^' K is governed by the generator —L e,K = L e,K . It has a 
fast and a slow components as e \. 0. The fast component of the process Xf K can be 
approximated by the process X t corresponding to the generator 

^ e ' K 1 K 

L u(x) = -(VF(x) x VG(x)) • Vu -\ div(oi(x)V«) . 

^ e ' K t 

The process X t lives on the surface M and, up to a simple time change t — >■ -, coincides 
with X t ' K . In particular, it has the same invariant density C\\7F(x)\~ 1 . 

To describe the slow component of X^ ,K , one should introduce a graph. Identify 
points of each connected component of every level set of the function F(x). Let 2) be 
the identification mapping. Then the set 2)(M 3 ) is homeomorphic to a graph provided 
with the natural topology which we denote by V. 

Note that all connected components of level sets not containing critical points of 
F(x) are two-dimensional compact (we assume that lim F(x) = oo manifolds). Each 

ie|— >oo 

local maximum or minimum of F(x) corresponds to an exterior vertex belonging just 
to one edge. The saddle points correspond to the interior vertices. Unlike in the case 
of generic functions of two variables, not every interior vertex belongs to three edges: 
If O is a saddle point of F(x), the surface {?/ £ R 3 : F(y) = F(,0)} divides each small 
neighborhood of O in three parts. But two of these parts, in the case of functions of 
three variables can come together far from O (compare with [12]). One can introduce a 
global coordinate system on T: Number the edges of I\ Then each point y £ T can be 
identified by two numbers k and z, where k is the number of an edge containing y and 
z = F(2)" 1 (y))- 
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The slow component of X\ ,K is the (not Markovian, in general) process 2)(X^' K ) = 
Y t £ ' K on T. 

Define a diffusion process Y% on T which inside each edge I k C T is governed by an 

ordinary differential operator L k = — —(ak(z)-r), where 

2Tk(z) dz dz 

I dim f 

T k (z) = / — — , a k (z) = / div(a 2 VF(x))dx , (6.3) 

J^- 1 {k,z) l Vi< Wl JG(k,z) 

where G(k, z) C M 3 is the domain bounded by the surface 2) _1 (A;, z); a 2 (x) = ^(x^^x), 
dm is the area element on 2) _1 (/c, z). 

The operators L k define the process Yj inside the edges. To define the behavior of 
Yt at the vertices, we describe the domain Da of the generator of Yt (see Ch.8 in [14]). 
We say that a continuous on T and smooth inside the edges function / G Da if and only 
if the following holds. 

• The function defined inside the edges by the formula L k f(k,z) can be extended 
to a continuous on the whole graph function. 

• If edges Ijj, Ii 2 , Ii 3 are attached to an interior vertex O, then 



J2(±)ai k (O)D k f(O) = 0, 



k=l 

where a,i k (0) = lim a,i k (z) (~d~i(z) is defined by (6.3)), and 

z->F(0) 

D4/(0) = , /(M)-/^(0)) 

z~>F(0) z - F(0) 

(compare with [12]). The sign convention in the gluing condition is as follows: Let 
^THJii) belong to the set {x G M 3 : F(x) > F(0)}, and 2) _1 (/j 2 ), 2) _1 (^ 3 ) C {x G 
IR 3 : F(x) < F(O)}. Then sign + should be taken in front of 0,^(0) and sign — in front 
of a,i 2 (0) and ai 3 (0). 

• If just two edges 1^ and Ii 2 are attached to an interior vertex O, then .0^/(0) = 
D l2 f(0). 

For functions f(k, z) with these properties, Af(k, z) = L k f(k, z). These conditions 
define the Markov process Yt on V in a unique way. Exterior vertices are inaccessible 
for Y t . 

Theorem 6.1. The process Y^' K = 2)(X^' K ) converges weakly in the space of 
continuous functions [0, T] — > T for each finite T > as e 4- to the (independent of k 
and a\{x)) process Yt defined above. 
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The proof of this statement follows from Theorem 2.1 of [12]. We omit the details. 
Using the absolute continuity arguments which we mentioned earlier one can consider 
more general perturbations in (6.1). 



Appendix 

1. We provide here the proof of Lemma 3.1. By a similar calculation as we did 
before stating Lemma 3.1 we have 



L(a(z),b(z)) Areafp! 



<S 1 (X) + S 2 (s) 



(A.l.l) 



L(b(z),c(z)) Area(D 2 ) 
(Here and below we use symbol 5k(p) to denote a positive quantity which goes to zero 
as the parameter \x 1 0.) 

We can also check, by mean value theorem, that 



Area(Di 



□i 



Area(D 2 ) 



-VF xVG + VF x (VF x b) 



-VF xVG + VF x (VF x b) 



By (3.3), it is easy to check that 



dm 



dm 



< S 3 (Xe) . 



(A.1.2) 



□i 



VF x VG + VF x (VF x b; 



dm 



VFxVG + VF x (VF x b; 



dm 



Si(z) 



V x (VF x b) • ndm 



Sa(*) 



V x (VF x b) • ndm 



< <5 4 (A) . 



(A.1.3) 

_ . , . ..... ... . . , . Area(Di) . 

By using the averaging principle, it is possible to show that the ratio is 

Area(D 2 ) 

asymptotically preserved along the flow of (2.7) (compare with [4]). Therefore we can 
take Di and D 2 as close to the separatrices hitting and exiting 2 (z) as we wish. This 
fact, together with the estimates (A.1.1)-(A.1.3), imply our Lemma 3.1, by letting first 
A i and then e 1 0. 



2. We explain here the missing details in the proof of Theorem 4.1. As we have 
explained in that proof, our process Z t ' satisfies the equation 



Zf = - £ K(zf)Vg(zf)+^(Z^)+^zf)+5^Z^)W t , Zf = z = (/i(*o(*)),/2(*o(*))) 



(A.2.1) 
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Here the term — c(Z|' ) comes from the Stratonovich integral in (4.9). 

As before, we can identify the connected components of the level sets of the Hamil- 
tonian Q to obtain a graph I\ Let 2) be the identification mapping. Let us use the same 
symbols to denote vertices and edges as those we use for the graph corresponding to 
X t £ ' (see Section 2). 

System (A.2.1), by a non-singular time change, can be reduced to a perturbed 
Hamiltonian system with Hamiltonian Q. The form of the operators governing the 
limiting diffusion inside the edges is obtained by standard averaging. To get the gluing 
conditions, we first consider an auxiliary process 



Z t 



.e,<5 



?£,<5\ 



K(zr)vg(zr)+sa(zr)w t , ^ 



e,S 



Z() 



(A.2.2) 



Such a process, by a non-singular time change, is equivalent to a perturbed Hamil- 
tonian system which has Lebesgue measure as its invariant measure. Using this fact, via 
a standard proof of [14, Chapter 8, Section 6], we conclude that the gluing condition for 
the weak limit of 2)(Z t e ' ) as e 1 at vertex Oi{z) is given by the coefficients 



/?: 



2A 



f(Ci(0,z)) 



\iVG) T a\ 2d k 
|j0| 



for z = 1,2, 3. Here /3 = kVQ. 

The measure // e ' corresponding to Z\' (0 < t < T) is related to the measure //' 



7 e,8 



corresponding to Z t ' (0 < t < T) via the Girsanov formula 



dfi; 



6,6 



dp 



S = I e J = exp \ - J a-\zf)[Mzf) + -c(Z £ /)] • dW t - 



25 2 7 



T 



-ls^e,S 



S^ 



a-\Zr)[p l {Zr) + -Z{Zt )] 



dt> ■ 



Lemma A.2.1. There exist constants A\ > 0, To > such that E zo (/q^ — l) 2 < 
AiT for allT < T . 

To prove this lemma, we first apply Ito's formula to (/^ — l) 2 and taking expected 
value. After that we use the fact that 



E zo exp { - 



T a-\zf)[Uzf) + b \~<zf)\ ■ dW t - 



2 



a-\Z e /)[Mzf)+ 5 -c{zf)] 



dt 
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and the Cauchy-Schwarz inequality. The proof is essentially the same as that of Lemma 
2.3 in [11]. 



For small A > we let 



D 2 (\) = {x£R 2 -.g(x)£ [-A,A]} . 



For i = 1,3, we let 



Z).(A) = {xeR 2 : g(f(Oi(z))) < Q(x) < g(f(Oi(z)))+X , x is in the well D { (0,z) containing Oi(z)}. 
For k = 1, 2, 3 we let 



rf(A) = inf{t>0,Z t £ 'V^(A)} 



We have 



Lemma A. 2. 2. For any positive [i > and k > £/iere exists Ao > such that 
for < A < Aq for sufficiently small e and all x G Z?2(A) 



E, 



r|'*(A) 



exp(—fit)dt < k\ 



and for all x G Dj(A) (« = 1,3,) we /iawe 

E Zo / exp(— fit)dt < k . 

Jo 



The proof of this Lemma is based on corresponding estimates for the process Zl' 
and Lemma A. 2.1. It is essentially the same as that of Lemma 2.4 in [11]. 



Lemma A. 2. 3. Let qi 



h,i 



where i = 1,2,3. We have, for any k > there 



t=i 

exist Ao > such that for < A < Ao there exist A' > such that for sufficiently small 
e we have 



P Z0 {Z^s (x) &C l ({-iyX,z)}-q t 



< K 



for allx£D2(X')UdD 2 (X'). 



The proof of this Lemma is also the same as that of Lemma 2.5 in [11]. 
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The slow component ft)(Z^' ) of the process Z\' converges weakly as e \. to a 
diffusion process Y t on T. The process Y t is defined by a family of differential operators, 
one on each edge of T, and by gluing conditions at the vertices. The operators and gluing 
conditions were calculated in Chapter 8 of [14]. The convergence of 2)(Z|' ) to Y t was 
also proved in [14]. 

To find the weak limit of the slow component 2)(^' ) of Z%' as e \. 0, note that 
the family 2)(Z t e ' ) is weakly compact as e 1 0. Inside each edge, the limit is a diffusion 
process with the generator defined by the standard averaging principle. The limiting 
process %{Z\' ) and %)(Zf' ) inside an edge, in general, are different. But as it follows 
from Lemmas A.2.1-A.2.3, the gluing conditions are the same. This implies that the 
family 2)(^' ) converges weakly as e 1 and identifies the limiting process as the process 
Y t in Theorem 4.1. 

3. We indicate here how to calculate the branching probabilities as claimed in 
Theorem 4.2. Let Y t be the diffusion process on graph T described in Theorem 4.1. Let 



£h(u) = {v £ T : p(u, v) < h} for u £ T, 



T S h =mm{t:Y t s ££ h ( U )} . 



Let pi and p^ be defined as in (3.13) and (3.14). We have 



Lemma A. 3.1. We have, for a small enough h, 



5s>iwnh/.>-o 



So P °^ Y r S h G ^ = Pi fOT i = 1; 3 



To prove this Lemma, we let u = (g,i) £ £ft(02(z))- We set vAu) = vAg,i) = 
Pr g {\{Y s G Ij}. The function vAg,i) is the unique continuous solution of the following 

h 

problem 



f L i v 6 j (g,i) = 0, (g,i)££ h (0 2 (z))\{0 2 (z)} , »= 1,2,3, 

v j(9,i)\( 9 ,i)€d£ h (02(z))nh = for i ^ j , 
Vj(g,j)\(g,j)eds h (o 2 (z))ni, = 1 , 

3 dv 5 

V(±)/3 2jfc lim -i-( 5 ,fe) = 0. 

£-j 9 ^ G( o 2 ( Z )) ^ 
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Here Li are defined in (4.5) and /?2,i are defined in (4.7) and (4.8), with " + " sign 
for k = 2 and " — " sign for k = 1,3. One can solve this problem explicitly and derive 
the statement of Lemma A. 3.1 similarly to Lemma 2.2 of [4]. 
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